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1.  SUMMARY  OF  PAST  RESEARCH 


In  the  first  year  of  funding  our  research  focused  on  classification  tasks  under  uncertainty,  with 
special  emphasis  on  propagating  beliefs  in  a  system  containing  a  mixture  of  categorical  and  pro¬ 
babilistic  relationships.  Previously  available  techniques  were  able  to  handle  taxonomic  trees, 
where  the  main  relationship  is  ISA  -  i.e.,  class  membership.  The  knowledge  involved  in  object 
recognition  contains  non-binary  relationships  (e.g.,  ^-BETWEEN)  arranged  in  non- 
decomposable  structures.  The  difficulties  encountered  stem  from  the  incompleteness  of  the 
model.  In  other  words,  we  normally  have  information  about  the  relationship  between  a  variable 
and  each  of  its  neighbors  but  not  between  a  variable  and  all  of  its  neighbors  taken  together.  This 
precludes  the  construction  of  a  complete  Bayesian  model.  To  overcome  this  difficulty,  we  have 
developed  a  new  formulation  of  the  Dempster-Shafer  theory  in  terms  of  a  static  constraint- 
networks  (representing  stable  knowledge)  bombarded  by  randomly  fluctuating  constraints, 
(representing  uncenain  evidence)  (Appendix  I).  We  have  also  devised  a  scheme  for  computing 
Dempster-Shafer  belief  functions  in  that  model,  using  Assumption-Based  Truth  Maintenance 
Systems  (ATMS)  and  Incidence  Calculus  (Appendix  I).  ^ _ 

As  an  alternative  to  the  Dempster-Shafer  theory,  we  have  investigated  another  scheme  of 
handling  panial  models,  based  on  the  observation  that  facts  and  rules  are  usually  communicated 
with  virtual  certainty,  without  attaching  to  them  numeric  measures  of  belief.  Qualitative 
representations  involve  simplicity  of  elicitation,  communication,  encoding,  and  computation. 
They  suffer  however  from  the  ills  of  classical  monotonic  logics  which  preclude  the  representa¬ 
tion  of  context-dependent  information.  In  Appendix  II  we  describe  a  system  which  retains  the 
context-dependent  nature  of  probabilistic  inferences,  yet  involves  non-numeric  propositions.  In¬ 
put  knowledge  is  cast  in  statements  with  four  levels  of  certainty:  true,  false,  likely  and  unlikely. 
However,  unlike  classical  multivalued  logics,  the  system  retains  the  essential  properties  of  pro¬ 
babilistic  inferences,  such  as  conditionalization,  independence  and  causality.  This  allow  us  to 
manipulate  evidence  by  logical  means  while  maintaining  the  probabilistic  semantics  (hence  the 
plausibility)  of  the  output  statements,  cast  in  the  same  4-value  vocabulary. 


Appendix  II  contains  examples  of  property  inheritance  relationships,  and  shows  how  the 
inference  scheme  handles  them  satisfactorily.  The  scheme  is  ripe  for  applications  involving  ob¬ 
ject  recognition,  but  further  study  is  required  to  identify  what  relationships  are  inherited  across 
spatial  relationships.  For  example,  only  some  of  the  properties  of  an  object  are  inherited  by  its 
pans  and,  vice  versa,  only  few  of  the  part’s  properties  are  inherited  by  the  object  as  a  whole. 


We  are  currently  studying  another  approach  to  handle  the  mixture  of  probabilistic  and 
categorical  relationships  for  object  recognition  tasks,  based  on  the  maximum-entropy  principle. 
This  approach  anempts  to  complete  the  nodel  and,  unlike  conventional  methods  of  maximum 
entropy,  exploits  the  "almost  categorical"  nature  of  spatial  constraints  to  extract  a  computational¬ 
ly  tractable  calculus  of  property  inheritance. 
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2.  LIST  OF  RESULTS 


•  An  axiomatic  system  (called  Graphoids)  was  developed,  which  provides  a  formal  charac¬ 
terization  of  informational  dependencies  and  their  graphical  representations. 

•  Techniques  were  developed  for  learning  causal  structures  from  emprirical  data. 

•  Relevance-based  control  strategies  were  developed  to  optimize  the  selection  of  tests  and 
to  minimize  the  network  activity  during  updating. 

•  The  interrelationships  between  the  Bayesian  and  Dempster-Shafer  approaches  to  uncer¬ 
tainty  were  given  formal  definition  in  terms  of  provability  conditions  and  constraint  net¬ 
works. 

•  Probabilistic  semantics  was  developed  for  a  subset  of  default  reasoning,  leading  to  con- 
sistencj'  criteria,  sound  inferences  and  the  qualitative  management  of  causality. 

•  Sound  and  complete  graphical  procedures  for  identifying  the  set  of  parameters  needed  for 
answering  a  given  probabilistic  query. 

•  A  proof  that  d-separation  is  the  most  complete  graphical  criterion  for  detecting  condi¬ 
tional  independencies  in  influence  diagrams. 

These  results  are  described  in  the  following  publications. 
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Abstract 

A  new  system  of  defeasible  inference  is  presented.  The  system  is  made  up  of  a  body 
of  six  rules  which  allow  proofs  to  be  constructed  very  much  lihe  in  natural  deduction 
systems.  Five  of  the  rules  are  shown  to  possess  a  sound  and  clear  probabilistic  seman¬ 
tics  that  guarantees  the  high  probability  of  the  conclusion  given  the  high  probability 
of  the  premises.  The  sixth  rule  appeals  to  a  notion  of  irrelevance;  we  explain  both  its 
motivation  and  use.* 


1  Motivation 

Belief  commitment  and  belief  revision  are  two  distinctive  characteristics  of  common  sense 
reasoning  which  have  so  far  resisted  satisfactory  formal  accounts.  ClassictJ  logic  for  in¬ 
stance.  cannot  accommodate  belief  revision:  new  information  can  only  add  new  theorems. 
Probability  theory,  on  the  other  hand,  heis  difficxilties  in  accommodating  behef  commit¬ 
ment:  propositions  are  believed  only  to  a  certain  degree  which  dynamically  changes  with 
the  acquisition  of  new  information. 

Recent  years  have  witnessed  a  renovated  effort  to  enhance  these  formalisms  in  order  to 
overcome  such  hmitations.  Those  %vorking  within  the  probabilistic  framework  have  tried  to 
de^^se  ‘acceptance  rules’  to  work  on  top  of  a  body  of  probabilistic  knowledge,  as  to  create 
a  body  of  beheved,  though  defeasible,  set  of  propositions  (e.g.  [Levi  80]).  Those  working 
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within  the  logic  framework  have  developed  ‘non-monotonic’  inference  systems  [AI  Journal 
80]  ba^ed  on  classical  logic,  in  which  old  ‘theorems’  can  be  defeated  by  new  ‘axioms’. 

In  comparison,  the  probabilistic  approach  has  enjoyed  a  significant  advantage  over 
the  logical  approach.  A  body  of  probabilistic  knowledge  together  with  an  acceptance  rule 
uniquely  determines  the  conclusions  that  can  be  derived.  Both  the  probabilistic  knowledge 
base  and  the  acceptance  rule  can  be  modified  so  as  to  capture  those  conclusions  that  appear 
reasonable.  Non  monotonic  logics,  on  the  other  hand,  have  lacked  such  clear  semantics. 
Not  only  it  has  been  difficult  to  tune  the  set  of  defeasible  rules  so  as  to  ‘entail’  the  desired 
conclusions  [see  Hzmks  and  McDermott  86],  but  it  has  even  been  difficult  to  characterize 
what  the  conclusions  sanctioned  by  a  body  of  ‘defaults’  ought  to  be  (see  [Touretzky  et  al. 
87],  “A  clash  of  intuitions  ...”). 

On  the  positive  side,  as  noted  by  [Glymour  et.  al.  84]  and  [Loui  87a],  the  logical 
approach  has  shown  that  a  qualitative  account  of  non-monotonic  reasoning,  which  does  not 
require  either  ‘acceptance  rules’  or  the  expense  auad  precision  of  computing  with  numbers, 
might  be  possible,  and  has  even  suggested  ways  in  which  such  an  account  can  be  provided. 

In  this  paper  we  attempt  to  show  that  it  is  possible  to  combine  the  best  of  both  worlds. 
We  present  a  system  of  defeasible  inference  which  operates  very  much  like  natural  deduction 
systems  in  logic  and,  yet,  can  be  justified  on  probabilistic  grounds.  The  resulting  system  is 
closely  related  to  the  logic  of  conditionals  developed  by  Adams  [Adams  66],  as  we  interpret 
defaults  of  the  form  P  —*  Q  as  constraining  the  conditional  probability  of  Q  given  P  to 
be  infinitesimally  close  to  one.  On  the  other  hand,  the  appeal  to  a  notion  of  relevance  in 
our  formulation  bears  a  close  relationship  to  those  approaches  which  investigate  defeasible 
reasoning  as  resulting  from  the  interaction  of  competing  arguments  (e.g.  [Touretzky  84: 
Poole  85;  Loui  87b;  Pollock  87]). 

The  structure  of  the  paper  is  as  follows.  In  section  2  we  define  the  core  of  the  system, 
discuss  the  need  for  providing  am  account  of  the  notion  of  irrelevance,  amd  present  such 
am  account.  In  section  3  we  illustrate  the  applicability  of  the  system  proposed  by  going 
through  am  stamdau:d  set  of  examples.  We  summairize  the  main  contributions  in  section  4. 


2  A  System  of  Defeasible  Inference 

2.1  The  Core 

The  logic  we  shall  present  will  be  referred  ais  L  and  will  be  characterized  by  a  set  of  rules  of 
inference,  in  the  style  of  naturad  deduction  systems.  The  goal  of  L  is  to  sanction  ais  theorems 
the  highly  likely  consequences  that  follow  from  a  given  context.  A  context  P  =  Ek  is 
defined  by  a  background  context  A',  which  expresses  generic  knowledge  relevant  to  the 
domain  of  discourse,  amd  an  evndentiail  set  A,  which  expresses  the  paxticulair  facts  which 
characterize  the  pajticulax  situation  of  interest.  A  background  context  K,  K  =  {L,D),  is 
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built  from  a  set  of  closed  wffs  L  2ind  a  set  D  of  defaults,  represented  by  meta^linguistic 
expressions  of  the  form  p  —*  q,  where  p  and  q  are  closed  wffs.  The  evidential  set  E  is  given 
by  a  collection  of  closed  ^vffs.  We  use  default  schemas  of  the  form  P(x)  — >  Q(x),  where  P 
and  Q  are  wffs  with  free  variables  among  those  of  x  =  {xi, . . . ,  x„},  to  represent  the  infinite 
collection  of  defaults  that  results  from  substituting  x  by  a  vector  a  of  ground  terms. 

The  system  of  inference  implicitly  defines  the  set  of  conclusions  h  that  follow  from  a 
given  context  p/,-.  with  K  =  {L,D).  We  write  E  ^  h  to  denote  that  sentence  h  is  derivable 
from  context  Ek  in  L.  Likewise,  E,  {/}  h,  abbreviated  E,f^h,  states  that  h  is  derivable 
from  the  context  that  results  from  adding  the  sentence  /  to  E.  We  shsdl  use  the  notation 
o{E)  to  refer  the  sentence  that  obtains  by  conjoining  the  sentences  in  E.  The  symbol  h 
stands  for  derivabihty  in  claissical  first  order  logic.  The  initial  set  of  rules  we  axe  going  to 
consider  is  the  following: 

Rule  1  (Defaults)  If  f  h  e  D  then  /  1)^  h 

Rule  2  (Logic  theorems)  If  £  U  fj  b  h  then  E  ^  h 

Rule  3  (Triangularity)  If  £  /  and  E  ^  h  then  E,f\j^h 

Rule  4  (Bayes)  If  £  /  and  E.  f  then  £  h 

Rule  5  (Disjunction)  If  £,  /  h  and  E,g  ^  h  then  £,  /  V  p  /i 

Rule  1  permits  us  to  conclude  the  consequent  of  a  default  when  its  antecedent  is  all 
that  has  been  learned.  Rule  2  states  that  theorems  that  logically  follow  from  a  set  of 
formulas  can  be  concluded  in  euiy  context  containing  those  formulas.  Rule  3  permits  the 
incorporation  of  an  estabhshed  conclusion  to  the  current  evidence  set,  without  affecting 
the  status  of  any  other  derived  conclusions.  Rule  4  says  that  any  conclusion  that  follows 
from  a  context  whose  evidence  set  was  augmented  with  a  conclusion  estabhshed  in  that 
context,  also  follows  from  the  original  context  alone.  Finally,  rule  5  permits  reasoning  by 
Ccises. 

Rules  2-5  can  be  shown  to  share  the  inferential  power  of  the  system  of  rules  proposed  by 
Adeims  in  [Adams  66]  for  deriving  what  he  calls  the  probabihstic  consequences  of  a  given  set 
of  conditionals.  They  also  appear,  in  different  form,  in  the  logic  of  conditionals  developed 
by  Stalnziker,  Lewis  and  others  during  the  seventies^.  Interestingly,  in  the  context  of  non¬ 
monotonic  logics,  Gabbay  [Gabbay  85]  has  als..  proposed  a  minimal  set  of  rules  which 
includes  rules  3  and  4  above. 

We  proceed  now  to  investigate  some  of  the  properties  of  the  system  of  defeasible  infer¬ 
ence  defined  by  rules  1-5.  Later  on.  we  shall  discuss  some  of  its  hmitations  as  we  enhance 
the  system  with  a  sixth  rule  which  attempts  to  capture  the  notion  of  irrelevance. 

^See  [N'ute  84]  for  a  review. 
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2.1.1  Some  Meta-Theorems 

Theorem  1  (Logical  Closure)  If  £"  /i,  £  l}j  h',  and  E,  h,  h'  h  h",  then  E  Ij^  h". 

By  rule  3,  we  obtain  E,h\pr  h'.  From  rule  2,  we  get  E,  /i,  h'  h".  Applying  rule  4  twice, 
the  theorem  is  proved. 

Theorem  2  (Equivalent  Contexts)  If  E  Ijj  /i  and  <^(E)  =  <f>{E')  then  E'  ^  h  . 

Since  E  I-  <Z>(E'),  by  applying  rules  2  and  3  we  get  E,  E'  ^  h;  which  together  with 
E'  h  (p{E)  and  rules  2  and  4,  leads  to  E'  Ijj  h. 

Theorem  3  (Exceptions)  If  E  tjj.  /i  and  E,f^~‘h  then  E 

From  E,f^  ->h,  we  can  obtaiin  by  theorem  1,  E,  /  Ijj-  ->/i  V  ->/.  On  the  other  hand,  from 
rule  2  we  can  conclude  E,  ->/  ~'/i  V  ->/.  Combining  these  two  results  by  means  of  rule 

5  and  theorem  2,  we  get  E  ~'h  V  ->/  and,  therefore,  E  by  virtue  of  theorem  1  and 

E  ^  h. 

Some  non- theorems; 

E  f  and  f  ^  h  does  not  necessarily  imply  E  ^  h 
E  and  E'  ^  h  does  not  necessarily  imply  E,E'  ^  h 

Note  that  the  first  non-theorem  is  clearly  undesirable.  If  accepted,  it  would  endow  our 
system  with  monotonic  characteristics  of  classical  logic,  precluding  exceptions  hke  non¬ 
flying  birds,  etc.  The  second  one  would  incorrectly  authorize  to  conclude  for  instance,  that 
John  will  be  happy  when  married  to  both  a  Jeme  aind  Mary,  on  the  grounds  that  he  will 
be  happy  when  married  to  either  one  of  them. 

.As  we  shall  see  later,  the  system  of  rules  1-5  defines  an  extremely  conservative  non¬ 
monotonic  logic.  In  fact,  the  inferences  sanctioned  by  these  rules  do  no  involve  any  type 
of  assumptions  regarding  information  absent  from  the  background  context.  To  illustrate 
this  fact,  let  K  =  {L,D)  and  K'  =  {L',D')  denote  two  background  contexts,  such  that 
K  <  K',  i.e.  L  C  L'  amd  D  C  D'.  We  have  the  following  theorem: 

Theorem  4  (K-monotonicity)  If  E  Ijj  h  and  K  <  K'  then  E  Ijj/  h. 

This  theorem  follows  easily  by  induction  on  the  minimal  length  n  of  the  derivation  of 
E  ^  h.  If  n  =  1,  it  means  that  h  weis  derived  from  E  in  K  either  by  rule  1  or  by  rtile  2. 
In  either  case  it  is  easy  to  show  that  h  can  be  derived  from  E  in  K'.  Let  us  assume  now 
that  h  is  derivable  from  E  in  K  in  n  steps,  n  >  1,  and  that  the  theorem  holds  for  all  the 
proofs  with  length  m  <  n.  Clearly  the  last  step  in  the  derivation  must  involve  one  of  the 
rules  3-5.  In  any  case,  the  antecedents  of  such  rule  must  be  derivable  in  a  number  of  steps 
smellier  than  n  and,  therefore,  by  the  inductive  assumption,  they  are  also  derivable  in  A'', 
from  which  it  follows  that,  using  the  same  rule,  h  is  eilso  derivable  from  E  in  A''. 

Finally,  rules  1-5  can  be  shown  to  be  probabilistically  sound.  That  is,  if  we  interpret 
defaults  of  the  form  p  —*  q  as  constraining  the  conditioned  probability  of  q  given  p  to  be 
infinitesimally  close  to  1,  we  can  show  that  each  rule  preserves  the  interpretation  which 
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assigns  a  conditional  probability  P{h\E,L)  infinitesimally  closed  to  1  to  expressions  of  the 
form  E  h.  We  omit  the  proof  here,  and  refer  the  interested  reader  to  [Adams  66]  and 
[Pearl  et.  al.  88]. 

We  now  turn  our  attention  to  an  exzimple  that  shows  how  the  body  of  rules  introduced 
so  far  can  account  for  simple  patterns  of  non- monotonic  reasoning. 


2.2  Example 

Example  1.  Let  us  consider  the  background  context  K  =  {L,D)  depicted  in  fig.  1  with 
L  =  {'ix.penguin{x)  D  bird{x)},  D  =  {penguin{x)  — »  -^flies{x),bird{x)  — >  flies{x)]. 

bird - ►  flies 

\/ 

penguin 

Figure  1;  The  penguin  triangle 

Concluding  that  ‘Tim  does  not  fly’  in  context  K  knowing  that  ‘Tim  is 
amounts  to  pro\'ing  penguin(Tim)  ->flie3(Tim),  The  proof  gets  reduced 
application  of  rule  1,  since  penguin{Tim)  —*  -<f\its[Tim)  e  D. 

Proving  ptnguin{Tim),bird{Tim)  -<flies{Tim)  is  slightly  different  since 
bird{Tim),  needs  to  be  assimilated.  The  proof  goes  as  follows:^ 

1.  penguin{Tim)  ^ ->flie3(Tim)  rule  1 

2.  penguin{Tim)  ^  hird{Tim)  rule  2 

3.  penguin{Tim),bird{Tim)  ^  -'flies{Tim)  rtile  3;  1,  2. 

Note  that  the  new  piece  of  information  available,  bird{Tim),  does  not  alter  the  conse¬ 
quences  that  followed  from  the  former  context,  as  such  new  piece  of  information  can  be 
shown  to  be  itself  one  of  its  consequences. 

It  is  important  to  note,  that  the  proper  handling  of  this  simple  hiereurchy  in  the  logic, 
without  the  need  to  exphcitly  encode  exceptions  (like  that  ‘penguins’  are  ‘abnormzd’  birds 
with  respect  to  flying),  arises  not  only  from  the  interpretation  of  defaults  embodied  in  the 
rules,  but  also  from  the  distinction  made  in  L  between  the  formulas  in  the  background 
context  K  from  those  in  the  evidential  set  E. 

To  illustrate  this  last  point,  let  us  consider  the  new  context 

^Proofs  appear  as  a  sequence  of  lines.  Each  formula  in  a  proof  has  associated  both  a  number  and  a 
justification.  The  latter  indicates  the  rule  used  in  deriving  the  formula,  as  well  as  the  conditions  that  make 
the  rule  applicable. 


a  penguin’ 
to  a  single 

a  new  fact. 
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r'l  =  {p€nguin(Tim),Vx.  penguin{x)  D  bird(x)}K'', 
with  A"  =  {L\D'),  I'  =  {}  and 


D'  =  {penguin(x)  — »  ~'flies{x),bird{x)  —*  flies{x)}, 

which  results  from  Fi  by  moving  the  class  inclusion  PB  =  Vx. penguin{x)  D  btrd(x)  from 
the  background  context  to  the  evidential  set.  We  find  that,  even  though  both  Fi  and  Fj 
comprise  the  same  wffs  and  defaults,  the  formula  -’//zes(T'im),  derivable  in  context  Fj  is 
not  derivable  in  F', .  That  is,  the  ‘preference’  for  the  conclusion  that  penguins  do  not  fly  in 
spite  of  being  birds,  is  not  explained  in  our  framework  solely  in  terms  of  class  specificity, 
but  aJso  in  terms  of  the  knowledge  presumed  by  the  default  rules. 

While  the  default  penguin(Tim)  — *  -> flies{Tim)  is  interpreted  in  K  as  asserting  a  con¬ 
ditional  probability  Ph:{^flies(Tim)\penguin(Tim),PB,L')  infinitesimally  close  to  one: 
the  same  default  is  interpreted  in  K'  as  asserting  the  rather  different  conditional  probabil¬ 
ity  Pi^i{~' flies{Tim)\pf‘nguin{Tim),  L')  to  be  infinitesimally  close  to  one.  In  order  words, 
in  A',  the  default  'penguins  don’t  fly’  zilready  presumes  penguins  to  be  birds.  In  K'  in¬ 
stead,  the  latter  fact  is  taken  to  represent  a  new  piece  of  knowledge,  independent  of  the 
background  knowledge  used  to  assume  that  most  penguins  do  not  fly,  and  which  happens 
to  support  the  opposite  conclusion. 

This  example  also  shows  that  formulas  cannot  be  freely  moved  between  the  background 
context  and  the  evidence  set  without  altering  the  meaning  of  the  theory  they  define. 
Propositions  in  a  background  context  A'  represent  generic  knowledge  presumed  by  all 
the  defaults  in  K.  Unhke  formulas  in  the  evidence  set,  they  do  not  represent  pieces  of 
evidence  that  need  to  be  2issimilated  in  order  to  reach  a  conclusion.  That  is  actually  the 
proof  theoretic  significance  of  rule  1. 


2.3  Irrelevance 

The  common  interpretation  of  defaults  of  the  type  a  —♦  6  is  in  the  form  of  a  disposition 
to  beheve  b  when  a  is  beheved  and  no  reason  for  not  doing  so  is  apparent.  This  reading 
has  two  implications  we  shall  be  concerned  with:  one  which  requires  conclusions  to  be 
retractable  in  the  light  of  new  refuting  evidence:  the  second  which  requires  conclusions  to 
persist  in  the  light  of  new  but  irrelevant  evidence.  Rules  1-5  excel  at  the  first  requirement: 
their  soundness  prevents  preserving  a  conclusion  in  a  context  in  which  its  high  probability 
ceinnot  be  gueiranteed.  In  example  1  we  have  shown,  for  instance,  that  whii  birds  can 
be  assumed  to  fly,  birds  known  to  be  penguins  cannot.  On  the  other  hand,  it  is  easy  to 
discover  that  the  same  body  of  rules  fail  miserably  in  the  second  aspect.  For  instance, 
given  a  background  context  K  =  {L.D)  with  £  =  {}  and  D  =  {a  —*  b],  rule  1  permits  the 
conclusion  a  b.  However,  if  a  new  piece  of  information  e,  that  bears  no  relation  to  b  is 
discovered,  rules  1-5  fail  to  prove  a,e  \^b  and.  therefore,  to  maintain  the  belief  in  b  in  the 
new  context. 
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This  'conservatism'  arises  as  no  surprise  from  a  set  of  rules  which  insist  on  probabilistic 
soundness:  while  there  is  no  reason  to  beheve  that  the  presence  of  e  in  the  context 
could  render  6  less  Ukely,  such  a  situation  would  be  perfectly  consistent  and,  since  a  sound 
conclusion  must  hold  in  every  probabilistic  model  of  /t ,  a,  e  1;^  6  is  not  sound  and,  therefore, 
not  provable  from  rules  1-5. Furthermore,  closer  inspection  of  rules  1-5  reveals  that  the 
only  type  of  evidence  that  can  be  assimilated  without  affecting  the  status  of  a  derived 
conclusion  is  evidence  which  is  subsumed  by  older  information  (hke  in  example  1,  in  which 
we  learn'  that  Tim  is  a  bird  after  knowing  he  is  a  penguin). 

Clearly  this  is  insufficient.  If  we  want  the  system  to  exhibit  retisonable  inferences,  Uke 
the  one  illustrated  by  the  example  above,  we  need  to  restrict  the  family  of  probabihstic 
models  relative  to  which  a  given  conclusion  must  be  checked  for  soundness.  We  want  those 
models  to  embed  the  common  sense  assumption  that  no  conclusion  should  be  retracted 
when  there  is  no  apparent  reason  for  doing  so. 

Our  attempt  will  be  precisely  to  provide  a  formal  account  of  what  these  re?isons  are, 
and  to  define  from  it  the  conditions  under  which  a  default  a  b  can  be  assumed  to  hold 
in  a  given  context  Ef^-.  In  probabihstic  terms,  to  specify  the  conditions  under  which  b  can 
be  assumed  to  be  conditionally  independent  on  E  given  a  and  K. 

The  idea  we  shall  pursue  is  simple.  We  shall  essentially  assume  that  E  provides  a 
reason  for  a  sentence  h.  an  argument  in  our  terminology*,  when  there  is  a  proof  for  E  ^  h 
that  is  logically  consistent  with  what  we  know.  That  is,  in  order  to  verify  whether  E  might 
support  h  ia  K  =  {L.D}.  we  assume  a  given  subset  of  defaults  S,  S  C  P,  to  hold,  and 
test  whether,  under  such  conditions,  there  is  a  derivation  of  h  from  E  amd  L. 

Formailly,  we  say  that  there  is  an  argument  for  h  with  support  S  from  a  set  E  of  wffs. 
iff  E  Ifj  /?  is  derivable  according  to  the  following  rules: 

Rule  A.l  If  /  — ►  /i  e  5  then  E.  f  llj  h.  for  any  set  E  of  sentences 

Rule  A. 2  U  E  h  then  E  \\^  h 

Rule  A. 3  If  E  1^5  /  and  E.  f  h  *^hen  E  h 

Rule  A. 4  If  E,  f  h  and  E.g  h  then  E.  f  y  g  h 

Note  that  except  for  rule  A.l.  which  relaxes  rule  1  above,  the  rest  of  the  rules  precisely 
correspond  to  rules  2.4  &:  5  above.  Furthermore,  the  rule  which  would  correspond  to  rule 
3  above  turns  out  to  be  redundant:  unlike  provabihty  in  L,  'arguabUity'  —  the  provabihty 
relation  aissociated  with  the  symbol  '  llj  — -is  monotonic.  That  is.  we  can  show  that  if 

.Another  way  of  looking  at  this  e.xample  is  by  considering  the  background  conte.xt  A''  =  (A'.  D')  >  A  = 

{L.  D).  with  L'  =  {}  and  D'  =  {a  —  6.  aAe - '*}.  Clearly  K'  does  not  permit  the  conclusion  6  from  a 

and  e.  However,  if  A'  sanct  ■  .ed  such  a  conclusion,  so  should  A'',  in  light  of  the  K-monotonicity  of  the  rules 
(Theorem  4). 

*As  borrowed  from  [Loui  f^Tb].  [Pollock  ST]  and  others. 


E  h  holds,  so  does  E.  £'  llj  /?. 

Now,  in  order  to  relate  the  arguabiUty  of  a  sentence  h  with  the  derivabihty  of  E,  L  It;  h, 
for  some  S  Q  D,  we  need  to  guarantee  that  we  have  not  incurred  any  inconsistency  by 
simultaneously  assuming  all  the  defaults  in  5  to  hold.  We  say  that  a  support  S  is  consistent 
in  context  Ef^.  if  for  no  sentence  /,  we  have  £,  £  II;  /  A  -if.  In  such  case,  if  we  can  prove 
E.  L  It;  /i  for  a  sentence  h.  vve  say  that  h  is  arguable  in  context  Ef^.^ 

For  instance,  figure  2  expresses  the  background  context  K  —  (L.D),  with 
£  =  {Vj.  AP{i)  D  P(j),  Vj.  RB{t)  D  B{x)} 
and 

D  =  {Fix)  B{x).P{x)  ^F{x),B{x)  ^  F{x)}, 

where  we  might  wish  to  interpret  P,  B,  AP,  RB  and  F  as  standing  for  the  predi¬ 
cates  penguin,  bird,  arctic  penguin,  red  bird  and  flies,  respectively.^  Given  that  Tim 
is  a  penguin,  P(Tim).  we  obtain  arguments  supporting  both  -iF{Tim)  amd  F{Tim). 
Arguments  supporting  the  former  correspond  to  the  path  P  ■/*  F  in  the  figure,  and 
require  a  (minimal)  support  S  =  {P{Tim)  — *•  -'F(Tim)};  arguments  for  F{Tim)  on 
the  other  hand,  correspond  to  the  path  P  —*  B  —*  F ,  aind  have  support  S'  = 
{P(Tim)  — +  B{Tim),  B{Tim)  —*  F{Tim)}. 


fl8  P 


t 

AP 

Figure  2:  Paths  and  Arguments 

We  are  interested  in  determining  whether  a  default,  say  P{Tim)  —*  ^FiTim). 
can  be  assumed  to  hold  once  we  take  into  account  an  additional  body  of  e^^dence 
E.  or  what  amounts  the  same,  to  determine  whether  to  authorize  the  conclusion 
E.  P[Tim)  -iF(Tim). 

Note  that  requiring  the  non-arguabihty  of  the  negation  of  the  default  consequent. 
F{Tim).  in  the  context  [E  U  {P(rirn)}}/v-,  is  too  strong  a  condition.  For  instance,  con- 

^This  notion  of  arguability  can  be  compared  to  the  notion  of  extension  in  Reiter's  default  logic  [Reiter 
80] .  It  is  not  difficult  to  show  that  the  existence  of  an  extension  that  sanctions  a  given  proposition  makes 
the  proposition  arguable.  It  does  not  work  the  other  way  though.  proposition  might  be  arguable  and  still 
not  be  present  in  any  extension.  This  is  because  Reiter's  logic  does  not  handle  reasoning  by  case^ 

'We  have  encoded  ‘penguins  are  birds'  as  a  default  rather  than  ais  a  class  inclusion,  simply  to  make  the 
discussion  below  more  illustrative. 
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sider  E  =  {AP(Tim)}.  Clearly  we  have  AP{TiTn),P{Tim)  11^  F{Tim),  with  S'  as  defined 
above,  as,  in  particular,  we  also  have  P(Tim)  Itj/  F{Tim).  Still,  there  is  no  question  that, 
with  the  information  available,  the  conclusion  -'F{Tim)  should  be  preserved. 

We  shall  base  the  criterion  for  determining  the  persistence  of  a  given  default  consequent 
upon  learning  a  body  of  evidence  E'  not  on  whether  E'  renders  its  negation  arguable,  but 
on  whether  E'  provides  additionsd  support  to  it.  This  latter  notion  is  defined  its  follows. 

We  say  that  the  body  of  evidence  E'  provides  additional  support  to  a  sentence  h  in 
context  Ek,  K  =  {L,D),  if  there  is  a  consistent  support  S,  S  C  D,  in  {P  U  E'}k,  such 
that  E,L  \^h  and  E,L\f^h.  If  E'  does  not  provide  additional  support  to  h  in  Ek,  we 
say  that  E'  is  irrelevant  to  h  in  such  context,  and  write  lK{h;  E'\E). 

In  the  example  above  for  instance,  we  have  that  E'  =  {AP(Tim)}  does  not  provide 
additional  support  to  either  F{Tim)  or  -<F(Tim)  in  context  {P(Ttm)}/c,  because  in  order 
to  render  either  proposition  airguable,  E'  requires  a  support  including  either  the  default 
B(Tim)  —>  F{Tim),  or  the  default  P{Tim)  -+  -<F{Tim)  which,  in  turn,  render  F{Tim) 
and  ->F(Tim),  respectively,  arguable,  in  {P(Tim)}K- 

Note  that  in  terms  of  the  graphs  we  have  been  using  to  represent  the  relationships 
embodied  in  a  given  background  context,  for  a  body  of  evidence  E'  to  provide  additional 
support  to  a  proposition  /i  in  a  context  Ek,  it  must  be  usually  the  case  that  there  is 
some  type  of  path  connecting  formidas  in  E'  to  h,  which  is  not  mediated  by  E.  Such 
graphical  interpretation  correctly  suggests  for  instance,  that  while  AP{Tim)  does  not 
provide  additional  support  to  F{Tim)  in  context  {P{Tim)}Ky  the  sentence  RB{Tim) 
does.  We  shall  often  find  useful  to  appeal  to  such  interpretation  when  we  consider  some 
examples  below. 

We  are  ready  now  to  provide  a  reasonable  sufficient  condition  under  which  a  given  de¬ 
fault  a  h  can  be  assumed  to  authorize  inferring  b  from  a,  in  the  presence  of  an  additional 
body  of  evidence  E: 

Rule  6’  (Explicit  Irrelevance)  If  a  — ►  6  e  £>  and  lK{~'b\  E|{a}),  then  a,  E  6. 

This  condition  attempts  to  capture  the  intuition  expressed  above  by  which  a  de¬ 
fault  a  — >  6  is  understood  as  providing  a  re2ison  for  concluding  b  from  a  as  long  *s 
no  reason  for  not  doing  so  is  apparent.  Such  new  rule  permits  to  derive  for  instance 
P{Tim),  AP(Tim)  )^-'F(Tim)  and,  therefore,  by  riiles  2  and  4,  AP{Tim)  yf^-^F{Tim). 
Note  that  neither  conclusion  was  derivable  by  means  of  rules  1-5  alone. 

Still,  rule  6’  is  not  strong  enough.  While  we  can  conclude  for  instance,  that  penguins, 
arctic  penguins  and  even  ‘penguin  birds’  are  likely  not  to  fly,  we  are  still  unable  to  conclude 
that  a  penguin  who  happens  to  be  a  red  bird  is  also  likely  not  to  fly.  The  reason  is 
that,  unlike  B{Tim)  or  APiTim),  RB{Tim)  cannot  be  shown  either  to  be  derivable  from 
P{Tim)  or  to  be  irrelevant  to  F{Tim)  in  context  {P(Tim)}K:.  However,  considering  the 
argument  by  which  RB{Tirri)  provides  additional  support  to  FiTivn)^  we  see  that  it  is 
not  the  ‘redness’  of  Tim  that  casts  doubt  about  its  inability  to  fly,  but  its  ‘birdness’;  even 
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when  ‘penguin  birds’  can  be  shown  not  to  fly.  Rule  6  below,  strengthens  and  generalizes 
rule  6’,  precluding  a  body  of  evidence  E  to  defeat  a  default  a  b  on  the  grounds  of  a 
property  /  known  to  be  irrelevant  to  such  a  default: 

Rule  6  (Implicit  Irrelevance) 

For  any  default  a  —*  b  in  D,  formula  /  amd  body  of  evidence  E, 

If  a  /  ,  a,E\jif  and  I K{~'b-,E\{a,f}),  then  a,E\;^b. 

Rule  6’  is  a  special  case  of  rule  6  in  which  /  =  true.  Note  that  for  non-tautological 
formulas  rule  6  imposes  the  additional  requirement  that  a^E  f  must  hold;  otherwise,  in 
particular,  we  could  choose  /  to  be  b  itself,  and  thus,  incorrectly  sanction  a,E\j^b  for  any 
body  of  evidence  E  consistent  with  L  U  {a,  6}. 

In  the  example  depicted  in  fig.  2,  we  can  now  show  P{Tim),  RB{Tim)  Ijj  ->F{Tim)  by 
invoking  rule  6  with  /  =  B{Tim). 

We  shall  illustrate  next  how  the  interpretation  of  defaults  embodied  by  the  system  of 
rules  1-6  usually  leads  to  ‘intuitive’  conclusions,  by  analyzing  in  detail  several  examples 
reported  in  the  AI  Uterature. 


3  Examples 


As  argued  above,  we  encode  generic  knowledge  of  the  domain  of  interest  in  K  and  include 
in  E  properties  and  relations  among  individuals.  Furthermore,  as  in  each  exrunple  below 
we  shall  be  dealing  with  single  place  predicates  and  a  single  individual,  say  a,  we  will  find 
convenient  to  abbreviate,  the  literal  [->]p{a),  for  any  property  p,  simply  by  [-']p.  Likewise, 
we  will  find  useful  to  label  default  schemas  P(i)  — >  Q{x)  with  a  neune,  say  di,  2ind  use 
such  label  to  refer  to  the  default  of  interest;  P{a)  Q{a)  in  this  case.  Furthermore,  when 
no  ambiguity  results,  we  eliminate  unnecessary  brackets,  as  in  Ik’Io;  which  we 

abbreviate  zis  /A  (a;  b\c). 

Example  2.  Let  us  consider  the  background  context  K  =  {L,D),  with  L  =  {}  and 
D  =  {  di  :  u^tudent{x)  — ►  adult{x),  d2  :  adult{x)  —*■  work{x), 

da  :  u^tudent(x)  ->■  ^work{x),  d4  :adult{x)  /\  under  J22{x)  u^tudent{x)]. 


adult— ►  work 


Figure  3:  Adults  under  22  usually  do  not  work 
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We  want  to  show  that  if  all  that  we  know  about,  say  Tom,  is  that  he  is  an  adult  under 
22  years  old,  then,  with  high  hkehhood,  we  can  conclude  that  Tom  does  not  work.  The 
proof  proceeds  as  follows:* 


1. 

adult,  under ^2  Ig-  ujstudent 

rule  1;  d4 

2. 

u.student,  under ^2  -'work 

rule  6’;  da,  Ifd-'work;  under. 22\u.student) 

3. 

u. student,  under  J12  adult 

rule  6’;  di,  I[({-'adult;  under J22\u.student) 

4. 

u.student,  under  J22,  adult  Ijj.  -'work 

rule  3;  3,  2 

5. 

adult,  under  J22  Ij^  -'work 

rule  4;  4,  1. 

Note,  for  instsmce,  that  irrelevance  of  under J12  to  -iwork  in  context  {u^tudent}f(, 
in  hne  2,  follows  from  the  fact  that  any  support  that  renders  -iwork  arguable  in  context 
{u^tudent,  under .22} K  must  include  the  default  ds,  thus,  rendering  ->work  arguable  also 
in  context  {u.atudent} k .  In  more  graphical  terms,  u.atudent  blocks  the  single  path  that 
leads  from  nodes  in  {u .student,  under J2.2}  to  ->work. 

It  is  interesting  to  note  that  from  the  same  background  knowledge,  we  can  also  derive 
that  an  arbitrziry  chosen  adult  is  Ukely  not  to  be  a  university  student: 

1.  u .student  \^-<w or k  rule  1;  d3 

2.  u.student  ^  adult  rule  1;  di 

3.  u. student,  adult  1^  -<work  rule  3;  2,  1 

4.  adult  Ig.  work  rule  1;  dj 

5.  adult  l^-  -'U.student  theorem  3;  3,  4. 

Example  3.[Sandewal  86,  Touretzky  et.  al.  87].  Let  K  =  {L,D)  be  given  as: 

L  =  \yx.royal.elephant{x)  D  elephant(x),'Vx.african.elephant{x)  D  elephant{x)}, 
D  =  {  di  :  elephant{x)  —*  gray(x),  d2  :  royal.elephant(x)  —*■  -<gray(x)}. 


african  elephant  royal_elephant 
Figure  4:  A  royal  african  elephant  is  not  gray 

We  want  to  show  that  a  royal  africam  elephant  is  hkely  to  be  not  gray: 

1.  r oyal.e I ephant  elephant  rule  2 

2.  royal.elephant,african.elephant  ^elephant  rule  2 

3.  royal.elephant,  african.elephant  -'gray  rule  6;  82,  1,  2,  Ik{-) 

The  last  step  uses  the  fact  lK{-^gray;  a frican]{royal,  elephant}),  which  can  be  under¬ 
stood  as  carrying  the  implicit  assumption  that  the  default  ‘most  royal  elephants  are  not 

«We  implicitly  use  the  results  of  theorems  1-3  to  freely  change  the  order  of  conjuncts  both  to  the  left  and 
to  to  right  of  the  provability  symbol  ‘  1^  ’• 
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gray’  also  holds  among  african  elephants.  We  assume  that  if  this  were  not  the  case,  the 
default  set  D  in  the  background  context  would  be  modified  accordingly,  either  by  expUcitly 
asserting  that  most  afncan  elephant  are  gray,  or  by  qualifying  the  default  that  states  the 
most  royal  elephants  are  not  gray.  In  either  case,  the  conclusion  we  have  derived  in  this 
context  would  be  blocked. 

Example  4.  [Touretzky  et.  al.  87].  Let  us  consider  now  K  =  {L,D),  with: 

L  =  {}, 

D  =  {d,:A{x)-^B(x),d2:A{x)-*^G{x),d3:B(x)^Gix), 
d^  :  B{x)  Cix),  ds  :  C(x)  Fix),  de  :  G(x)  ^  -F(x)}. 


A 


Figure  5:  A’s  are  F's 

The  goal  here  is  to  derive  that  A’s  are  F's.  The  intuition  is  to  show  that  both  C  and 
-’G  foUow  from  A,  and  that  the  latter  blocks  A  from  -'F.  The  proof  proceeds  as  follows: 

1.  C,^G,A^^F  ruled’;  ds, //c(-F;{-G,>l}lC) 

2.  A  -'G  rule  1;  d2 

3.  Af^B  rule  1;  di 

4.  B,A^C  tule  6';  di,  If({C;  A\B) 

5.  A  C  rtile  4;  4,  3 

6.  A^CA-'G  theorem  1;  2,  5 

7.  A^F  rule  4;  1,6. 

Note  that  {--G,  >1}|G)  holds  due  to  the  fact  that  the  presence  of  -'G  rules  out 

any  support  which  entails  G. 

Example  5.  We  consider  now  K  =  {L,D),  with  L  =  {},  and  D  = 

{quaker{x^  pacifi3t{x),republican(x)  — ►  ->paci/zst(x)}. 

Given  that  Nixon  is  both  a  quaier  and  a  repubhcan,  no  conclusion  can  be  drawn 
regarding  his  pacifism.  In  our  opinion,  drawing  no  conclusion  is,  in  this  case,  preferred 
to  drawing  two  conflicting  extensions,  as  in  normal  default  theories  [Reiter  80].  It  clearly 
indicates  that  the  knowledge  embedded  in  K  is  not  sufficient  to  integrate  the  available 
pieces  of  evidence  to  arrive  at  a  conclusion.  Enhancing  the  background  context  to  include 
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ajiother  default,  like  quakers  who  also  are  republicans  are  still  pacifists,  would  solve  the 
ambiguity  without  introducing  any  inconsistencies. 

Example  6;  (M.  Ginsberg)  Let  us  consider  the  background  context  K  =  {L,D),  with: 

L  =  {} 

D  =  {  di  :  quaker{x)  — ►  dove{x),  d2  :republican{x)  hawk{x), 

ds  :dove{x)  — ►  -<hawk{x),  d^  :  hawk{x)  — >  -<dove(x), 
ds  :  dove{x)  — >  pjnotivated{x),  de  ;  hawk{x)  — »  pjrnotivated{x)} 


p_motivated 


quaker  republican 
Figure  6:  Is  Nixon  pohticedly  motivated  ? 

The  conclusion  that  Nixon  is  politically  motivated,  given  that  he  is  both  a  quaker  and 
a  repubhcan,  would  follow  if  we  could  derive  that  he  is  either  a  hawk  or  a  dove.  However 
the  latter  disjunction  does  not  follow  from  rules  1-6,  since  D  does  not  provide  sufficient 
reasons  for  believing  either  that  quakers  who  are  also  republicans  are  still  likely  to  be 
doves,  or  that  repubUcans  who  tire  also  quakers  are  still  likely  to  be  hawks.® 

Example  7.  Let  us  finally  consider  a  background  context  K  =  {L,D),  with  Z.  =  {}  and 
D  =  {>l(x)  — +  C{x),A{x)  — >  B{x),A{x)  aC{x)  — +  -<B{x)}.  This  context  turns  out  to  be 
inconsistent.  D  entitles  us  to  conclude  both  B  and  C  from  A,  tind  -'B  from  A  and  C,  in 
contradiction  with  rule  3,  which  permits  to  maintain  conclusions  derived  in  a  context  Ek, 
in  the  enhanced  context  {E  Li  {f}}^,  when  /  is  itself  one  of  the  ‘expected’  consequences 
of  Eh'-  Reiter’s  default  logic  on  the  other  hand,  would  not  detect  any  inconsistency  in 
such  knowledge  baise.  Note  that  in  our  framework,  a  context  comprising  the  sets  L'  =  {}, 
D'  =  {-4(x)  —*  B{x)}  and  E'  —  {>l(a), ->5(0)}  is  perfectly  consistent. 


4  Related  Work 

As  noted  in  [Reiter  ei.  al.  81],  the  logic  for  default  reasoning  proposed  in  [Reiter  80] 
requires  exceptions  to  be  explicitly  stated  in  order  to  prevent  the  multiplicity  of  spurious 

®lf  this  lack  of  commitment  seems  counter-intuitive  it  is  because  the  information  contained  in  the  fact 
that  ‘typically  republicans  are  politically  motivated’  (independently  of  whether  they  are  hawks  or  doves) 
has  not  been  encoded  in  the  background  context.  In  fact,  if  we  replace  ‘politically  motivated’  by  ‘having  an 
extreme  position  on  defense  issues’,  not  drawing  a  conclusion  seems  to  be  the  most  reasonable  choice. 
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extensions.  Recently,  several  novel  systems  of  defeasible  inference  have  been  proposed, 
motivated  by  the  intmtion  that  it  should  be  possible  to  filter  the  effect  of  spurious  exten¬ 
sions  without  the  need  to  make  exceptions  explicit.  Among  them,  the  system  closest  in 
spirit  to  the  scheme  proposed  in  this  paper  is  the  system  of  defeasible  inference  proposed 
by  Loui.'° 

Loui's  system  [Loui  87b]  is  made  up  of  a  set  of  rules  to  evaluate  arguments.  He  defines  a 
set  of  (syntactic)  argument  attributes  (like  "has  more  evidence’,  ‘is  more  specific’,  etc.),  and 
a  set  of  rules,  which  allow  the  comparison,  evaluation,  and  selection  of  arguments.  This 
set  of  rules  seems  to  implicitly  embed  most  of  the  inference  rules  that  define  our  system, 
and  can  be  mostly  justified  in  terms  of  them.  Still,  it  is  possible  to  find  some  differences. 
One  such  difference  is  that  Loui’s  system  is  not  (logicaJly)  closed.  It  is  possible  to  believe 
propositions  A  and  B,  and  still  fziil  to  believe  AaB  [Loui  87b].  In  our  scheme,  the  closure 
of  the  propositions  believed  follows  from  theorems  1  and  2.  In  particular,  if  the  arguments 
for  A  and  R  in  a  given  theory  are  completely  symmetric,  and  A  A  B  does  not  follow  for 
some  reason  (like  conflicting  evidence),  then  neither  A  nor  B  will  follow. 

Another  difference  arises  due  to  the  absolute  preference  given  by  his  system  to  argu¬ 
ments  based  on  ‘more  evidence’.  As  the  following  example  shows,  this  criterion  might 
lead  to  counter-intuitive  results.  Consider  the  context  K  =  {L,D)  with  L  =  {]  and  D  = 
{A  — >  B,C  — V  ->5,  .4.  A  F  — >  C};  Loui’s  system  would  conclude  ~>B,  given  the  evidence 
E  =  {A,  F},  merely  because  the  evidence  supporting  the  argument  A  B,  constitutes  a 
proper  subset  of  the  evidence  supporting  the  competing  argument  AaF  —*  C  —*  ->B.  Yet, 
if  proposition  C,  whose  truth  was  presumed  in  the  argument  supporting  -'B,  were  learned. 
Loui’s  system  would  retract  its  belief  in  -’B,  since  C  renders  both  F  and  A  irrelevant  to 
-iB  and,  therefore,  neither  the  argument  which  supports  B,  nor  the  argument  that  sup¬ 
ports  -iB,  could  be  said  to  be  based  on  ‘more  eridence’  than  the  other.  Our  system,  as 
expected,  will  draw  no  conclusion  in  either  case,  since  the  joint  influence  of  both  A  and  C 
on  B  (or  -’B)  cannot  be  derived  from  the  given  context. 

The  system  reported  by  Touretzky  in  [Touretzky  84]  was  motivated  by  the  goal  of 
providing  a  semantics  for  inheritance  hierarchies  with  exceptions.  He  argues  that  there  is 
a  natural  ordering  of  defaults  in  inheritance  hierarchies  that  c^m  be  used  to  filter  spuri¬ 
ous  extensions.  In  this  way,  his  system  succeeds  in  capturing  inferences  that  seem  to  be 
reasonable,  but  which  escape  unaided,  fixed-point  semantic  systems  like  Reiter’s.  Still, 
Toureztky’s  system  cem  be  regarded  more  eis  a  refinement  of  Reiter’s  logic  than  as  a  de¬ 
parture  from  it  (see  [Etherington  87]).  As  such,  it  still  requires  testing,  outside  the  ‘logic’, 
whether  a  given  proposition  holds  in  every  (remaining)  extension.  Moreover,  requirements 
of  acyclicity  aje  at  the  heart  of  the  definition  of  the  inferential  distance  principle,  restrict¬ 
ing,  therefore,  its  range  of  applicabiUty.  It  is  interesting  to  note  that  rules  3  and  6  seem 
to  convey  ideais  very  similar  to  Touretzky’s  inferential  distance.  StiU,  while  the  inferential 

*°In  [Delgrande  87],  Delgrande  builds  a  logic  of  defaults  based  on  a  first  order  conditional  logic  which 
renders  a  core  of  rules  similar  to  our  rules  1-.5,  except  for  the  fact  that  he  makes  defaults  part  of  the 
object  language.  Out  systems  differ  mainly  in  the  semantics:  ours  is  probabilistic,  his  is  based  on  possible 
worlds.  This  difference  motivates  a  different  set  of  intuitions  and  proposal  for  approaching  the  problem  .,i 
characterizing  irrelevance. 
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distance  principle  is  used  to  discard  'inadmissible'  arguments,  the  rules  presented  in  section 
2  are  used  to  prevent  them  from  ever  evolving  to  a  ratified  conclusion. 

In  [Poole  85],  Poole  has  proposed  another  mechanism  for  dealing  with  the  problem  of 
multiple,  spurious,  answers  that  arises  in  Reiter’s  default  logic.  This  mecheinism  consists 
of  comparing  the  'specificity’  of  the  knowledge  embedded  in  the  eirguments  supporting 
contradictory  conclusions.  An  argument  shown  to  be  strictly  'more  general’  than  another 
argument,  can  be  discarded.  This  criterion  seems  in  fact  very  close  to  Touretzky’s  in¬ 
ferential  distance.  Still,  they  seem  to  differ  in  an  important  aspect.  Unlike  Touretzky, 
Poole  compares  the  specificity  of  the  arguments  isolated  from  the  rest  of  the  knowledge 
base.  It  seems  that  this  might  lead  to  undesirable  results.  For  instance,  in  example  2 
(fig.  3),  none  of  the  arguments  supporting  the  conclusion  that  Tom  works,  or  that  Tom 
does  not  work,  can  be  determined  to  be  more  specific  if  the  default  that  states  that  most 
students  are  adults —  which  does  not  take  part  in  the  competing  arguments —  is  ignored. 
Additionally,  like  Reiter’s  and  Toureztky’s,  Poole’s  system  seems  to  also  require  to  testing, 
outside  the  ‘logic’,  whether  a  proposition  holds  in  every  (remaining)  extension  in  order  for 
the  proposition  to  be  accepted. 


5  Summary 

The  main  contribution  of  the  proposed  framework  for  defezisible  inference  is  the  emergence 
of  a  more  precise,  proof-theoretic  and  semantic  account  of  defaults.  A  default  P  Q,  'm 
a  background  context  A',  represents  a  clear  cut  constraint  on  states  of  affairs,  stating  that, 
if  P  is  all  that  has  been  learned,  then  Q  can  be  concluded.  We  appealed  to  probabiUty 
theory  to  uncover  the  logic  that  governs  this  type  of  ‘context  dependent  implications’. 

•Additionally  we  have  introduced  a  notion  of  irrelevance  as  a  set  of  sufficient  conditions 
under  which  belief  in  the  consequent  of  a  given  default  cein  be  preserved  upon  acquiring 
new  information.  This  notion  is  used  very  much  hke  frame  axioms  Eire  used  in  AI:  we 
aissume  defaults  to  hold  upon  acquiring  a  new  piece  of  evidence  E,  sis  long  sis  E  does  not 
provide  a  reason  for  not  doing  so. 

The  scheme  proposed  here  avoids  the  problem  of  multiple,  spurious  extensions  that 
normally  arises  in  default  logics.  Moreover,  we  do  not  need  to  explicitly  consider  all  the 
extensions  in  order  to  prove  that  a  given  proposition  follows  from  a  given  theory.  Proofs 
in  our  system  proceed  ‘inside  the  logic’,  and  look  very  much  hke  proofs  constructed  in 
natural  deduction  systems  in  logic. 

I'he  system  is  also  clean:  the  only  appeal  to  ‘provabihty’  in  the  inferential  machinery  is 
to  determine  when  a  proposition  can  be  safely  assumed  to  be  irrelevant  to  another  propo¬ 
sition  in  a  given  context.  But,  in  contrast  to  most  non-monotonic  logics,  the  definition  of 
non-monoionic  provabihty  is  not  circular.  The  irrelevance  predicate  used  for  constructing 
proofs  can  be  inferred  syntacticEiUy  in  terms  of  arguments  only. 
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